Introduce `UserDefinedLogicalNodeUnparser` for User-defined Logical Plan unparsing #13880

goldmedal · 2024-12-22T10:07:45Z

Which issue does this PR close?

Closes #13753 .

Rationale for this change

See the previous discussion for the design: #13753 (comment)

UserDefinedLogicalNodeUnparser provides two APIs for user-defined behavior:

unparse: Unparse the custom logical node to SQL within a statement.
unparse_to_statement: Unparse the custom logical node to a statement.

What changes are included in this PR?

Introduce UserDefinedLogicalNodeUnparser
Make the AST builders public
Add examples for unparsing custom logical plan

Are these changes tested?

yes

Are there any user-facing changes?

New trait and API

goldmedal · 2024-12-22T12:24:58Z

Could @phillipleblanc or @sgrebnov take a look at this? Thanks

sgrebnov · 2024-12-22T20:32:56Z

datafusion/sql/src/unparser/mod.rs

+    /// The child unparsers are called iteratively.
+    /// There are two methods in [`Unparser`] will be called:
+    /// - `extension_to_statement`: This method is called when the custom logical node is a custom statement.
+    ///     If multiple child unparsers return a non-None value, the last unparsing result will be returned.


@goldmedal - I'm not sure using the last unparsing result is the expected behavior. As a user, I would expect to get the result from the first udlp_unparser that supports this node and stop checking the remaining udlp_unparsers instead.

Is there a specific use case / reason for using the last supported udlp_unparser? They can be dynamically registered and the last one should override perviously registered? To match unparse behavior where we don't know/track if unparsing is applied so we always apply all?

Yeah, I would also expect this to short-circuit and have the first one win.

sgrebnov · 2024-12-22T20:44:58Z

datafusion/sql/src/unparser/plan.rs

+        select: &mut Option<&mut SelectBuilder>,
+        relation: &mut Option<&mut RelationBuilder>,
+    ) -> Result<()> {
+        for unparser in &self.udlp_unparsers {


It might be good to add indication that unparse applied to be consistent with unparse_to_statement and throw error if non of registered udlps applied / successfully processed the node.

Yeah, if none of the extension unparsers were able to process it, it should throw an error IMO

sgrebnov · 2024-12-22T20:45:39Z

@goldmedal - looks great, two minor questions/comments

phillipleblanc

Thanks @goldmedal! I have a few minor comments, but this this a good upgrade for the unparser!

phillipleblanc · 2024-12-23T02:08:35Z

datafusion/sql/src/unparser/mod.rs

+    ///     If multiple child unparsers return a non-None value, the last unparsing result will be returned.
+    /// - `extension_to_sql`: This method is called when the custom logical node is part of a statement.
+    ///    If multiple child unparsers are registered for the same custom logical node, all of them will be called in order.
+    pub fn with_udlp_unparsers(


Not a fan of this name - udlp takes effort to understand what it means. How about renaming udlp_* to extension_*? i.e. with_extension_unparsers. It conveys the same meaning in an easier to understand way.

phillipleblanc · 2024-12-23T02:09:35Z

datafusion/sql/src/unparser/mod.rs

+    /// The child unparsers are called iteratively.
+    /// There are two methods in [`Unparser`] will be called:
+    /// - `extension_to_statement`: This method is called when the custom logical node is a custom statement.
+    ///     If multiple child unparsers return a non-None value, the last unparsing result will be returned.


Yeah, I would also expect this to short-circuit and have the first one win.

phillipleblanc · 2024-12-23T02:10:10Z

datafusion/sql/src/unparser/mod.rs

+    /// - `extension_to_statement`: This method is called when the custom logical node is a custom statement.
+    ///     If multiple child unparsers return a non-None value, the last unparsing result will be returned.
+    /// - `extension_to_sql`: This method is called when the custom logical node is part of a statement.
+    ///    If multiple child unparsers are registered for the same custom logical node, all of them will be called in order.


I think this should also short-circuit and only do the first one?

phillipleblanc · 2024-12-23T02:17:46Z

datafusion/sql/src/unparser/plan.rs

+        select: &mut Option<&mut SelectBuilder>,
+        relation: &mut Option<&mut RelationBuilder>,
+    ) -> Result<()> {
+        for unparser in &self.udlp_unparsers {


Yeah, if none of the extension unparsers were able to process it, it should throw an error IMO

phillipleblanc · 2024-12-23T02:18:26Z

datafusion/sql/src/unparser/udlp_unparser.rs

I would also rename this file to extension_unparser.rs

phillipleblanc · 2024-12-23T02:21:27Z

datafusion/sql/tests/cases/plan_to_sql.rs

+    if let Some(err) = plan_to_sql(&plan).err() {
+        assert_eq!(
+            err.to_string(),
+            "External error: `relation` must be initialized"


This error is expected?

goldmedal added 5 commits December 22, 2024 14:33

make ast builder public

c76dbae

introduce udlp unparser

2335276

add documents

640bc93

add examples

4a32991

add negative tests and fmt

35dac96

github-actions bot added the sql SQL Planner label Dec 22, 2024

goldmedal mentioned this pull request Dec 22, 2024

Support unparsing LogicalPlan::Extension to SQL tesxt #13753

Open

fix the doc

85fb3a4

goldmedal marked this pull request as ready for review December 22, 2024 12:23

sgrebnov reviewed Dec 22, 2024

View reviewed changes

sgrebnov approved these changes Dec 22, 2024

View reviewed changes

phillipleblanc reviewed Dec 23, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce `UserDefinedLogicalNodeUnparser` for User-defined Logical Plan unparsing #13880

Introduce `UserDefinedLogicalNodeUnparser` for User-defined Logical Plan unparsing #13880

goldmedal commented Dec 22, 2024 •

edited

Loading

goldmedal commented Dec 22, 2024

sgrebnov Dec 22, 2024 •

edited

Loading

phillipleblanc Dec 23, 2024

sgrebnov Dec 22, 2024

phillipleblanc Dec 23, 2024

sgrebnov commented Dec 22, 2024

phillipleblanc left a comment

phillipleblanc Dec 23, 2024

phillipleblanc Dec 23, 2024

phillipleblanc Dec 23, 2024

phillipleblanc Dec 23, 2024

phillipleblanc Dec 23, 2024

phillipleblanc Dec 23, 2024

Introduce UserDefinedLogicalNodeUnparser for User-defined Logical Plan unparsing #13880

Are you sure you want to change the base?

Introduce UserDefinedLogicalNodeUnparser for User-defined Logical Plan unparsing #13880

Conversation

goldmedal commented Dec 22, 2024 • edited Loading

Which issue does this PR close?

Rationale for this change

What changes are included in this PR?

Are these changes tested?

Are there any user-facing changes?

goldmedal commented Dec 22, 2024

sgrebnov Dec 22, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

sgrebnov commented Dec 22, 2024

phillipleblanc left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Introduce `UserDefinedLogicalNodeUnparser` for User-defined Logical Plan unparsing #13880

Introduce `UserDefinedLogicalNodeUnparser` for User-defined Logical Plan unparsing #13880

goldmedal commented Dec 22, 2024 •

edited

Loading

sgrebnov Dec 22, 2024 •

edited

Loading